Overview

Dataset statistics

Number of variables18
Number of observations12330
Missing cells0
Missing cells (%)0.0%
Duplicate rows125
Duplicate rows (%)1.0%
Total size in memory2.9 MiB
Average record size in memory247.1 B

Variable types

NUM14
BOOL2
CAT2

Reproduction

Analysis started2020-04-13 14:04:15.995391
Analysis finished2020-04-13 14:04:48.674861
Versionpandas-profiling v2.5.4
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Dataset has 125 (1.0%) duplicate rows Duplicates
ExitRates is highly correlated with BounceRatesHigh Correlation
BounceRates is highly correlated with ExitRatesHigh Correlation
Administrative has 5768 (46.8%) zeros Zeros
Administrative_Duration has 5903 (47.9%) zeros Zeros
Informational has 9699 (78.7%) zeros Zeros
Informational_Duration has 9925 (80.5%) zeros Zeros
ProductRelated_Duration has 755 (6.1%) zeros Zeros
BounceRates has 5518 (44.8%) zeros Zeros
PageValues has 9600 (77.9%) zeros Zeros
SpecialDay has 11079 (89.9%) zeros Zeros

Variables

Administrative
Real number (ℝ≥0)

ZEROS
Distinct count27
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.3151662611516626
Minimum0
Maximum27
Zeros5768
Zeros (%)46.8%
Memory size96.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q34
95-th percentile9
Maximum27
Range27
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.321784106
Coefficient of variation (CV)1.434792897
Kurtosis4.701146249
Mean2.315166261
Median Absolute Deviation (MAD)2.511874992
Skewness1.960357209
Sum28546
Variance11.03424965
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 10.5 12.5 15.5 18.5 27. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 5768 46.8%
 
1 1354 11.0%
 
2 1114 9.0%
 
3 915 7.4%
 
4 765 6.2%
 
5 575 4.7%
 
6 432 3.5%
 
7 338 2.7%
 
8 287 2.3%
 
9 225 1.8%
 
Other values (17) 557 4.5%
 
ValueCountFrequency (%) 
0 5768 46.8%
 
1 1354 11.0%
 
2 1114 9.0%
 
3 915 7.4%
 
4 765 6.2%
 
ValueCountFrequency (%) 
27 1 < 0.1%
 
26 1 < 0.1%
 
24 4 < 0.1%
 
23 3 < 0.1%
 
22 4 < 0.1%
 

Administrative_Duration
Real number (ℝ≥0)

ZEROS
Distinct count3335
Unique (%)27.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean80.81861053933592
Minimum0.0
Maximum3398.75
Zeros5903
Zeros (%)47.9%
Memory size96.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median7.5
Q393.25625
95-th percentile348.2663691
Maximum3398.75
Range3398.75
Interquartile range (IQR)93.25625

Descriptive statistics

Standard deviation176.7791075
Coefficient of variation (CV)2.187356431
Kurtosis50.55673905
Mean80.81861054
Median Absolute Deviation (MAD)98.05916626
Skewness5.615719019
Sum996493.468
Variance31250.85284
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 6.66666666e-01 1.66666667e+00 3.75000000e+00 4.16666667e+00 ... 5.15797738e+02 6.76262255e+02 1.01844333e+03 1.75752381e+03 3.39875000e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 5903 47.9%
 
4 56 0.5%
 
5 53 0.4%
 
7 45 0.4%
 
11 42 0.3%
 
6 41 0.3%
 
14 37 0.3%
 
9 35 0.3%
 
15 33 0.3%
 
10 32 0.3%
 
Other values (3325) 6053 49.1%
 
ValueCountFrequency (%) 
0 5903 47.9%
 
1.333333333 1 < 0.1%
 
2 15 0.1%
 
3 26 0.2%
 
3.5 4 < 0.1%
 
ValueCountFrequency (%) 
3398.75 1 < 0.1%
 
2720.5 1 < 0.1%
 
2657.318056 1 < 0.1%
 
2629.253968 1 < 0.1%
 
2407.42381 1 < 0.1%
 

Informational
Real number (ℝ≥0)

ZEROS
Distinct count17
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5035685320356853
Minimum0
Maximum24
Zeros9699
Zeros (%)78.7%
Memory size96.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile3
Maximum24
Range24
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.270156426
Coefficient of variation (CV)2.522310957
Kurtosis26.93226626
Mean0.503568532
Median Absolute Deviation (MAD)0.792232148
Skewness4.03646376
Sum6209
Variance1.613297346
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 6.5 7.5 10.5 15. 24. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 9699 78.7%
 
1 1041 8.4%
 
2 728 5.9%
 
3 380 3.1%
 
4 222 1.8%
 
5 99 0.8%
 
6 78 0.6%
 
7 36 0.3%
 
9 15 0.1%
 
8 14 0.1%
 
Other values (7) 18 0.1%
 
ValueCountFrequency (%) 
0 9699 78.7%
 
1 1041 8.4%
 
2 728 5.9%
 
3 380 3.1%
 
4 222 1.8%
 
ValueCountFrequency (%) 
24 1 < 0.1%
 
16 1 < 0.1%
 
14 2 < 0.1%
 
13 1 < 0.1%
 
12 5 < 0.1%
 

Informational_Duration
Real number (ℝ≥0)

ZEROS
Distinct count1258
Unique (%)10.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34.47239792772304
Minimum0.0
Maximum2549.375
Zeros9925
Zeros (%)80.5%
Memory size96.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile195
Maximum2549.375
Range2549.375
Interquartile range (IQR)0

Descriptive statistics

Standard deviation140.7492944
Coefficient of variation (CV)4.082956304
Kurtosis76.31685309
Mean34.47239793
Median Absolute Deviation (MAD)57.54982408
Skewness7.579184716
Sum425044.6664
Variance19810.36388
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 5.00000000e-01 1.75000000e+00 5.75000000e+00 6.16666667e+00 ... 3.03637500e+02 5.05800000e+02 9.26500000e+02 1.52960000e+03 2.54937500e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 9925 80.5%
 
9 33 0.3%
 
6 26 0.2%
 
10 26 0.2%
 
7 26 0.2%
 
13 23 0.2%
 
12 23 0.2%
 
8 22 0.2%
 
16 22 0.2%
 
11 21 0.2%
 
Other values (1248) 2183 17.7%
 
ValueCountFrequency (%) 
0 9925 80.5%
 
1 3 < 0.1%
 
1.5 1 < 0.1%
 
2 11 0.1%
 
2.5 1 < 0.1%
 
ValueCountFrequency (%) 
2549.375 1 < 0.1%
 
2256.916667 1 < 0.1%
 
2252.033333 1 < 0.1%
 
2195.3 1 < 0.1%
 
2166.5 1 < 0.1%
 

ProductRelated
Real number (ℝ≥0)

Distinct count311
Unique (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.731467964314678
Minimum0
Maximum705
Zeros38
Zeros (%)0.3%
Memory size96.5 KiB

Quantile statistics

Minimum0
5-th percentile1
Q17
median18
Q338
95-th percentile109
Maximum705
Range705
Interquartile range (IQR)31

Descriptive statistics

Standard deviation44.4755033
Coefficient of variation (CV)1.401621361
Kurtosis31.21170665
Mean31.73146796
Median Absolute Deviation (MAD)26.85052945
Skewness4.341516416
Sum391249
Variance1978.070394
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.000e+00 5.000e-01 1.500e+00 3.500e+00 8.500e+00 ... 1.825e+02 2.390e+02 3.605e+02 4.445e+02 7.050e+02], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 622 5.0%
 
2 465 3.8%
 
3 458 3.7%
 
4 404 3.3%
 
6 396 3.2%
 
7 391 3.2%
 
5 382 3.1%
 
8 370 3.0%
 
10 330 2.7%
 
9 317 2.6%
 
Other values (301) 8195 66.5%
 
ValueCountFrequency (%) 
0 38 0.3%
 
1 622 5.0%
 
2 465 3.8%
 
3 458 3.7%
 
4 404 3.3%
 
ValueCountFrequency (%) 
705 1 < 0.1%
 
686 1 < 0.1%
 
584 1 < 0.1%
 
534 1 < 0.1%
 
518 1 < 0.1%
 

ProductRelated_Duration
Real number (ℝ≥0)

ZEROS
Distinct count9551
Unique (%)77.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1194.7462199688268
Minimum0.0
Maximum63973.522229999995
Zeros755
Zeros (%)6.1%
Memory size96.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q1184.1375
median598.9369047
Q31464.157213
95-th percentile4300.289077
Maximum63973.52223
Range63973.52223
Interquartile range (IQR)1280.019713

Descriptive statistics

Standard deviation1913.669288
Coefficient of variation (CV)1.601737052
Kurtosis137.1741637
Mean1194.74622
Median Absolute Deviation (MAD)1105.052192
Skewness7.263227683
Sum14731220.89
Variance3662130.143
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 2.50000000e-01 2.83333333e+00 7.55500000e+01 2.24083333e+02 ... 7.01472972e+03 1.00457826e+04 1.45726235e+04 2.59270078e+04 6.39735222e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 755 6.1%
 
17 21 0.2%
 
8 17 0.1%
 
11 17 0.1%
 
15 16 0.1%
 
19 15 0.1%
 
22 15 0.1%
 
12 15 0.1%
 
7 14 0.1%
 
13 14 0.1%
 
Other values (9541) 11431 92.7%
 
ValueCountFrequency (%) 
0 755 6.1%
 
0.5 1 < 0.1%
 
1 2 < 0.1%
 
2.333333333 1 < 0.1%
 
2.666666667 1 < 0.1%
 
ValueCountFrequency (%) 
63973.52223 1 < 0.1%
 
43171.23338 1 < 0.1%
 
29970.46597 1 < 0.1%
 
27009.85943 1 < 0.1%
 
24844.1562 1 < 0.1%
 

BounceRates
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS
Distinct count1872
Unique (%)15.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.02219138047072182
Minimum0.0
Maximum0.2
Zeros5518
Zeros (%)44.8%
Memory size96.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0.0031124675
Q30.0168125585
95-th percentile0.2
Maximum0.2
Range0.2
Interquartile range (IQR)0.0168125585

Descriptive statistics

Standard deviation0.04848832181
Coefficient of variation (CV)2.185007006
Kurtosis7.723159431
Mean0.02219138047
Median Absolute Deviation (MAD)0.02886451968
Skewness2.947855267
Sum273.6197212
Variance0.002351117352
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 1.36500000e-05 5.32624000e-04 1.84994250e-03 1.85507800e-03 ... 1.17692307e-01 1.20714285e-01 1.52777778e-01 1.91666667e-01 2.00000000e-01], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 5518 44.8%
 
0.2 700 5.7%
 
0.066666667 134 1.1%
 
0.028571429 115 0.9%
 
0.05 113 0.9%
 
0.033333333 101 0.8%
 
0.025 100 0.8%
 
0.016666667 99 0.8%
 
0.1 98 0.8%
 
0.04 96 0.8%
 
Other values (1862) 5256 42.6%
 
ValueCountFrequency (%) 
0 5518 44.8%
 
2.73e-05 1 < 0.1%
 
3.35e-05 1 < 0.1%
 
3.83e-05 1 < 0.1%
 
3.94e-05 1 < 0.1%
 
ValueCountFrequency (%) 
0.2 700 5.7%
 
0.183333333 1 < 0.1%
 
0.18 5 < 0.1%
 
0.176923077 1 < 0.1%
 
0.175 1 < 0.1%
 

ExitRates
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count4777
Unique (%)38.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.04307279776650446
Minimum0.0
Maximum0.2
Zeros76
Zeros (%)0.6%
Memory size96.5 KiB

Quantile statistics

Minimum0
5-th percentile0.004567568
Q10.014285714
median0.0251564025
Q30.05
95-th percentile0.2
Maximum0.2
Range0.2
Interquartile range (IQR)0.035714286

Descriptive statistics

Standard deviation0.04859654055
Coefficient of variation (CV)1.128242024
Kurtosis4.017034553
Mean0.04307279777
Median Absolute Deviation (MAD)0.03330133931
Skewness2.148789
Sum531.0875965
Variance0.002361623754
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 8.77965000e-05 1.01801650e-03 3.33159900e-03 3.34625300e-03 ... 1.52380952e-01 1.64583333e-01 1.67708334e-01 1.96153846e-01 2.00000000e-01], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.2 710 5.8%
 
0.1 338 2.7%
 
0.05 329 2.7%
 
0.033333333 291 2.4%
 
0.066666667 267 2.2%
 
0.025 224 1.8%
 
0.04 214 1.7%
 
0.016666667 181 1.5%
 
0.02 167 1.4%
 
0.022222222 152 1.2%
 
Other values (4767) 9457 76.7%
 
ValueCountFrequency (%) 
0 76 0.6%
 
0.000175593 1 < 0.1%
 
0.000250438 1 < 0.1%
 
0.000262123 1 < 0.1%
 
0.000263158 1 < 0.1%
 
ValueCountFrequency (%) 
0.2 710 5.8%
 
0.192307692 1 < 0.1%
 
0.188888889 2 < 0.1%
 
0.186666667 4 < 0.1%
 
0.183333333 2 < 0.1%
 

PageValues
Real number (ℝ≥0)

ZEROS
Distinct count2704
Unique (%)21.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.889257862693592
Minimum0.0
Maximum361.76374189999996
Zeros9600
Zeros (%)77.9%
Memory size96.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile38.16052828
Maximum361.7637419
Range361.7637419
Interquartile range (IQR)0

Descriptive statistics

Standard deviation18.56843661
Coefficient of variation (CV)3.152933195
Kurtosis65.63569361
Mean5.889257863
Median Absolute Deviation (MAD)9.417352744
Skewness6.382964249
Sum72614.54945
Variance344.7868381
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 1.90172710e-02 6.77058064e-01 1.02389280e+01 2.12695215e+01 ... 6.48840910e+01 8.92151487e+01 1.16977229e+02 1.78539373e+02 3.61763742e+02], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 9600 77.9%
 
53.988 6 < 0.1%
 
42.29306752 3 < 0.1%
 
40.27815244 2 < 0.1%
 
12.55885714 2 < 0.1%
 
44.89345937 2 < 0.1%
 
58.9241766 2 < 0.1%
 
16.1585582 2 < 0.1%
 
10.99901844 2 < 0.1%
 
21.2112655 2 < 0.1%
 
Other values (2694) 2707 22.0%
 
ValueCountFrequency (%) 
0 9600 77.9%
 
0.038034542 1 < 0.1%
 
0.067049546 1 < 0.1%
 
0.093546949 1 < 0.1%
 
0.098621403 1 < 0.1%
 
ValueCountFrequency (%) 
361.7637419 1 < 0.1%
 
360.9533839 1 < 0.1%
 
287.9537928 1 < 0.1%
 
270.7846931 1 < 0.1%
 
261.4912857 1 < 0.1%
 

SpecialDay
Real number (ℝ≥0)

ZEROS
Distinct count6
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.061427412814274135
Minimum0.0
Maximum1.0
Zeros11079
Zeros (%)89.9%
Memory size96.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0.6
Maximum1
Range1
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.1989172732
Coefficient of variation (CV)3.238249245
Kurtosis9.91365887
Mean0.06142741281
Median Absolute Deviation (MAD)0.110389993
Skewness3.302666747
Sum757.4
Variance0.03956808156
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.1 0.3 0.5 1. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 11079 89.9%
 
0.6 351 2.8%
 
0.8 325 2.6%
 
0.4 243 2.0%
 
0.2 178 1.4%
 
1 154 1.2%
 
ValueCountFrequency (%) 
0 11079 89.9%
 
0.2 178 1.4%
 
0.4 243 2.0%
 
0.6 351 2.8%
 
0.8 325 2.6%
 
ValueCountFrequency (%) 
1 154 1.2%
 
0.8 325 2.6%
 
0.6 351 2.8%
 
0.4 243 2.0%
 
0.2 178 1.4%
 

Month
Categorical

Distinct count10
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size96.5 KiB
May
3364
Nov
2998
Mar
1907
Dec
1727
Oct
 
549
Other values (5)
1785
ValueCountFrequency (%) 
May 3364 27.3%
 
Nov 2998 24.3%
 
Mar 1907 15.5%
 
Dec 1727 14.0%
 
Oct 549 4.5%
 
Sep 448 3.6%
 
Aug 433 3.5%
 
Jul 432 3.5%
 
June 288 2.3%
 
Feb 184 1.5%
 

Length

Max length4
Mean length3.023357664
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 14 63.6%
 
Uppercase_Letter 8 36.4%
 
ValueCountFrequency (%) 
Latin 22 100.0%
 
ValueCountFrequency (%) 
ASCII 22 100.0%
 

OperatingSystems
Real number (ℝ≥0)

Distinct count8
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.124006488240065
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Memory size96.5 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median2
Q33
95-th percentile3
Maximum8
Range7
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.9113248287
Coefficient of variation (CV)0.4290593432
Kurtosis10.45684261
Mean2.124006488
Median Absolute Deviation (MAD)0.6040751989
Skewness2.066285042
Sum26189
Variance0.8305129434
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1. 1.5 2.5 3.5 4.5 7.5 8. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2 6601 53.5%
 
1 2585 21.0%
 
3 2555 20.7%
 
4 478 3.9%
 
8 79 0.6%
 
6 19 0.2%
 
7 7 0.1%
 
5 6 < 0.1%
 
ValueCountFrequency (%) 
1 2585 21.0%
 
2 6601 53.5%
 
3 2555 20.7%
 
4 478 3.9%
 
5 6 < 0.1%
 
ValueCountFrequency (%) 
8 79 0.6%
 
7 7 0.1%
 
6 19 0.2%
 
5 6 < 0.1%
 
4 478 3.9%
 

Browser
Real number (ℝ≥0)

Distinct count13
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.357096512570965
Minimum1
Maximum13
Zeros0
Zeros (%)0.0%
Memory size96.5 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median2
Q32
95-th percentile5
Maximum13
Range12
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.717276676
Coefficient of variation (CV)0.7285559443
Kurtosis12.74673269
Mean2.357096513
Median Absolute Deviation (MAD)1.003084664
Skewness3.242349611
Sum29063
Variance2.94903918
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 2.5 3.5 4.5 ... 8.5 9.5 10.5 12.5 13. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2 7961 64.6%
 
1 2462 20.0%
 
4 736 6.0%
 
5 467 3.8%
 
6 174 1.4%
 
10 163 1.3%
 
8 135 1.1%
 
3 105 0.9%
 
13 61 0.5%
 
7 49 0.4%
 
Other values (3) 17 0.1%
 
ValueCountFrequency (%) 
1 2462 20.0%
 
2 7961 64.6%
 
3 105 0.9%
 
4 736 6.0%
 
5 467 3.8%
 
ValueCountFrequency (%) 
13 61 0.5%
 
12 10 0.1%
 
11 6 < 0.1%
 
10 163 1.3%
 
9 1 < 0.1%
 

Region
Real number (ℝ≥0)

Distinct count9
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.1473641524736413
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Memory size96.5 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q34
95-th percentile8
Maximum9
Range8
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.401591237
Coefficient of variation (CV)0.7630484178
Kurtosis-0.1486803001
Mean3.147364152
Median Absolute Deviation (MAD)1.933807362
Skewness0.9835491595
Sum38807
Variance5.767640468
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1. 1.5 2.5 3.5 4.5 5.5 7.5 8.5 9. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 4780 38.8%
 
3 2403 19.5%
 
4 1182 9.6%
 
2 1136 9.2%
 
6 805 6.5%
 
7 761 6.2%
 
9 511 4.1%
 
8 434 3.5%
 
5 318 2.6%
 
ValueCountFrequency (%) 
1 4780 38.8%
 
2 1136 9.2%
 
3 2403 19.5%
 
4 1182 9.6%
 
5 318 2.6%
 
ValueCountFrequency (%) 
9 511 4.1%
 
8 434 3.5%
 
7 761 6.2%
 
6 805 6.5%
 
5 318 2.6%
 

TrafficType
Real number (ℝ≥0)

Distinct count20
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.069586374695864
Minimum1
Maximum20
Zeros0
Zeros (%)0.0%
Memory size96.5 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median2
Q34
95-th percentile13
Maximum20
Range19
Interquartile range (IQR)2

Descriptive statistics

Standard deviation4.02516916
Coefficient of variation (CV)0.9890855703
Kurtosis3.479710597
Mean4.069586375
Median Absolute Deviation (MAD)2.902031916
Skewness1.962986732
Sum50178
Variance16.20198677
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 2.5 3.5 4.5 ... 14.5 15.5 17.5 19.5 20. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2 3913 31.7%
 
1 2451 19.9%
 
3 2052 16.6%
 
4 1069 8.7%
 
13 738 6.0%
 
10 450 3.6%
 
6 444 3.6%
 
8 343 2.8%
 
5 260 2.1%
 
11 247 2.0%
 
Other values (10) 363 2.9%
 
ValueCountFrequency (%) 
1 2451 19.9%
 
2 3913 31.7%
 
3 2052 16.6%
 
4 1069 8.7%
 
5 260 2.1%
 
ValueCountFrequency (%) 
20 198 1.6%
 
19 17 0.1%
 
18 10 0.1%
 
17 1 < 0.1%
 
16 3 < 0.1%
 

VisitorType
Categorical

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size96.5 KiB
Returning_Visitor
10551
New_Visitor
 
1694
Other
 
85
ValueCountFrequency (%) 
Returning_Visitor 10551 85.6%
 
New_Visitor 1694 13.7%
 
Other 85 0.7%
 

Length

Max length17
Mean length16.09294404
Min length5
ValueCountFrequency (%) 
Lowercase_Letter 11 68.8%
 
Uppercase_Letter 4 25.0%
 
Connector_Punctuation 1 6.2%
 
ValueCountFrequency (%) 
Latin 15 93.8%
 
Common 1 6.2%
 
ValueCountFrequency (%) 
ASCII 16 100.0%
 

Weekend
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size12.2 KiB
False
9462
True
2868
ValueCountFrequency (%) 
False 9462 76.7%
 
True 2868 23.3%
 

Revenue
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size12.2 KiB
False
10422
True
 
1908
ValueCountFrequency (%) 
False 10422 84.5%
 
True 1908 15.5%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

AdministrativeAdministrative_DurationInformationalInformational_DurationProductRelatedProductRelated_DurationBounceRatesExitRatesPageValuesSpecialDayMonthOperatingSystemsBrowserRegionTrafficTypeVisitorTypeWeekendRevenue
000.000.010.0000000.2000000.2000000.00.0Feb1111Returning_VisitorFalseFalse
100.000.0264.0000000.0000000.1000000.00.0Feb2212Returning_VisitorFalseFalse
200.000.010.0000000.2000000.2000000.00.0Feb4193Returning_VisitorFalseFalse
300.000.022.6666670.0500000.1400000.00.0Feb3224Returning_VisitorFalseFalse
400.000.010627.5000000.0200000.0500000.00.0Feb3314Returning_VisitorTrueFalse
500.000.019154.2166670.0157890.0245610.00.0Feb2213Returning_VisitorFalseFalse
600.000.010.0000000.2000000.2000000.00.4Feb2433Returning_VisitorFalseFalse
710.000.000.0000000.2000000.2000000.00.0Feb1215Returning_VisitorTrueFalse
800.000.0237.0000000.0000000.1000000.00.8Feb2223Returning_VisitorFalseFalse
900.000.03738.0000000.0000000.0222220.00.4Feb2412Returning_VisitorFalseFalse

Last rows

AdministrativeAdministrative_DurationInformationalInformational_DurationProductRelatedProductRelated_DurationBounceRatesExitRatesPageValuesSpecialDayMonthOperatingSystemsBrowserRegionTrafficTypeVisitorTypeWeekendRevenue
1232000.0000.08143.5833330.0142860.0500000.0000000.0Nov2231Returning_VisitorFalseFalse
1232100.0000.060.0000000.2000000.2000000.0000000.0Nov1841Returning_VisitorFalseFalse
12322676.2500.0221075.2500000.0000000.0041670.0000000.0Dec2242Returning_VisitorFalseFalse
12323264.7500.0441157.9761900.0000000.0139530.0000000.0Nov22110Returning_VisitorFalseFalse
1232400.0010.016503.0000000.0000000.0376470.0000000.0Nov2211Returning_VisitorFalseFalse
123253145.0000.0531783.7916670.0071430.02903112.2417170.0Dec4611Returning_VisitorTrueFalse
1232600.0000.05465.7500000.0000000.0213330.0000000.0Nov3218Returning_VisitorTrueFalse
1232700.0000.06184.2500000.0833330.0866670.0000000.0Nov32113Returning_VisitorTrueFalse
12328475.0000.015346.0000000.0000000.0210530.0000000.0Nov22311Returning_VisitorFalseFalse
1232900.0000.0321.2500000.0000000.0666670.0000000.0Nov3212New_VisitorTrueFalse